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(57) Abstract 

The present application makes use of a novel 
noise-predictive maximum -likelihood (NPML) data 
detection scheme (10) operating on signal samples 
received via an equalizing filter (22) from a channel, 
and in particular a storage channel of a direct access 
storage device. This scheme arises by applying a noise 
prediction/ whitening process to the output signal of said 
equalizers and by providing means in the branch metric 
computation of a maximum-likelihood sequence detector 
(MLSD). It furthermore* provides for cancellation of 
intersymbol- interference (ISI) components of said signal 
samples, by means of an appropriate table look-up. The 
contents of the table look-up are addressed by using 
decisions from the path histories of the maximum-likelihood 
sequence detector. 
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DESCRIPTION 



APPARATUS AND METHOD FOR NOISE-PREDICTIVE MAXIMUM- LIKELIHOOD (NPML) 
DETECTION 



TECHNICAL FIELD 



10 The invention relates to data detection methods and apparatus, particularly 
methods and apparatus for partial-response signaling and 
maximum-likelihood sequence detection. It further relates lo direct access 
storage devices (DASDs) based on these methods. 

15 BACKGROUND OF THE INVENTION 

Application of partial-response (PR) class-IV (PR4) equalization and maximum 
likelihood sequence detection (MLSD) has been shown in theory and practice 
to achieve near optimal performance at recording densities of 

20 0.8 < PW50/T < 1 .6. where PW50 is the pulse width at the 50% amplitude 
point of the channel's step response and T is the duration of the channel 
encoded bit. A partial response maximum likelihood (PRML) system for the 
magnetic recording channel has been described in A PRML system for Digital 
Magnetic Recording." Roy D. Cideciyan et al.. IEEE Journal on Selected Areas 

25 in Communications. Vol. 10. No. 1. pp. 38 - 56, January 1992. In the US 
patent No. 4786.890 a class-IV PRML channel using a run-length limited (RLL) 
code has also been disclosed. 

At high recording densities, i.e.. PW50/T 1 .6. the linear partial-response 
30 class-IV equalizer leads to substantial noise enhancement As a consequence, 
the performance of the PRML detector suffers and may become inadequate to 
meet product specifications. Application of extended partial-response 
maximum likelihood (EPRML) detectors has been shown in theory and 
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practice to achieve better performance than PRML detectors in the range 
PW50/T > 1.6. The patent application GB-A-2286952. published on 30 August 
1995. discloses a novel EPRML scheme for data detection in a direct access 
storage device. The novel architecture of the invention claimed therein allows 
for the addition of EPRML detectors to PRML channels with only minor 
changes to the overall channel architecture. 

The optimum MLSD receiver for detecting an uncoded data sequence in the 
presence of intersymbol-interference (ISI) and additive Gaussian noise 
consists of a whitened-matched filter followed by a Viterbi detector which 
performs maximum likelihood sequence detection on the ISI trellis, as 
described by G. D. Forney in Maximum-likelihood sequence estimation of 
digital sequences in the presence of intersymbol interference." IEEE Trans. 
Inform. Theory. Vol. IT-18. No. 3. pp. 363 - 378. May 1972. For the magnetic 
recording channel the state complexity of this trellis is given by 2- where L 
represents the number of relevant ISI terms in the output signal of the 
whitened-matched filter. In the patent application W094/29989 with title 
"Adaptive noise-predictive partial-response equalizing for channels with 
spectral nulls." filed 14 June 1993 and published 22 December 1994. and in 
reference Noise predictive partial-response equalizers and applications." P. 
R. Chevillat et al.. IEEE Conf. Records ICC'92. June 14-18 1992. pp. 
0942 - 0947. it was shown that a partial-response zero forcing equalizer 
cascaded with a linear predictor whose coefficients have been suitably 
chosen, is equivalent to the whitening discrele-time prefiller of the optimum 
25 MLSD receiver. Furthermore, in the same patent application a receiver 
structure has been disclosed where the prediction process has been 
imbedded in the Viterbi detector corresponding to the partial- response trellis. 
The above patent application W094/29989 is primarily concerned with wire 
transmission systems. 



15 



20 



30 



In the above patent application W094/29989 and the article of P. R. Chevillat 
et al. it has been concluded that noise-prediction in conjunction with PRML 
improves detector performance. 
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1 It is an object of the present invention to provide a method and apparatus with 
improved data detection performance. 

It is an object of the present invention to provide a method and apparatus for 
5 improved data detection in direct access storage devices with the purpose to 
overcome the performance problems in prior art schemes. 

It is an object of the present invention to provide a method and apparatus to 
achieve higher linear storage density in direct access storage devices 
10 (DASDs). 

It is another object of the present invention to provide a method and 
apparatus which can be employed in a conventional PRML/EPRML direct 
access storage device without changing the principal architecture of the 
is -electronic channel. 

SUMMARY OF THE INVENTION 

20 The above objects have been accomplished by providing an entire family of 
estimation detectors which can for example be used for data detection in 
DASDs. Some of the present detectors, which make specific use of properties 
of the magnetic recording channel, arise by imbedding a noise 
prediction/whitening process into the branch metric computation of the 

25 maximum-likelihood sequence detector and are collectively called Noise 
Predictive Maximum Likelihood (NPML) detectors. They furthermore comprise 
means for cancellation of intersymbol-interference (IS!) components by an 
appropriate table look-up. In contrast to the patent application W094/29989 
and the article of P. R. Chevillat et al. where the state complexity of the 

30 detector is fixed and determined by the partial-response trellis, the NPML 
detectors have a state complexity which is equal to 2 K . where 0 < K < L and L 
reflects the number of controlled (known) Intersymbol Interference (ISI) 
components introduced by the combination of PR equalizer and predictor. 
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The special case where K = L is equivalent to the optimum MLSD detector 
for the given predictor length and the special case K = 0 corresponds to a 
noise-predictive PR equalizer followed by a memoryless detector. For 
1 ^ K < L the NPML detector operates on a reduced set of ISI states. At the 
same time, the (L-K) ISI terms (components) not represented in the 
state-space of the NPML detector are compensated in a decision-feedback 
fashion by using decisions from the path history. Thus, the NPML detectors 
offer a trade-off between performance and state complexity and/or length of 
decision-feedback and they provide a substantial gain in linear recording 
density over PRML and EPRML detectors. In addition, the present 
implementations of NPML detectors do not require multiplications in the 
imbedded predictor and thus allow simple random access memory (RAM) 
look-up implementation for ISI cancellation. Furthermore. NPML detectors 
generally do not exhibit quasi-catastrophic error propagation. Thus, additional 
-increases in recording density can be achieved with higher rate run-length 
limited (RLL) codes by relaxing the constraints relating to the survivor path 
memory. Finally, besides modularity and substantial gains in performance, the 
NPML detectors have the important implementation advantage that they can 
be "piggy-backed" on existing PRML/EPRML systems. Therefore, there is no 
need for the development and implementation of an entirely new channel 
architecture which is a costly and complex task. 



Also described and claimed are low complexity derivatives of the NPML 
detector family which offer appreciable performance gains. The respective 
25 schemes include, but are not limited to. two-state interleaved NPML detectors 
and cascaded noise-predictors with PRML detectors. Furthermore, derived 
from an NPML scheme with a single-tap predictor, a programmable 8-state 
NPML detector is described which is capable to operate also as a PRML or 
EPRML detector. 



30 
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! DESCRIPTION OF THE DRAWINGS 

AND NOTATIONS USED 

The invention is described in detail below with reference to the following 
5 drawings: 

FIG. 1 Shows a block diagram used to illustrate how the inventive NPML 

detectors fit into the existing PRML channel architecture. 

10 FIG. 2A Shows the blocks of Figure 1 which are relevant for the present 
invention: the digital equalizer 22. the present NPML detector 10, 
and inverse precoder 23. 

FIG. 2B Shows an equivalent form of the present NPML detector 10, 
15 * according to the present invention. 

FIG. 2C Shows another equivalent, more detailed, form of the present 
NPML detector 10. according to the present invention. 

20 FIG. 2D Shows yet another equivalent form of the present NPML 
detector 10. according to the present invention. 

FIG. 2E Shows another possible embodiment of a sequence detector with 
imbedded feedback, according to the present invention. 

25 

FIG. 3A Shows the noise-predictive part using a memoryless detector in 
cascade with a conventional PRML detector, according to the 
present invention. 

30 FIG. 3B Shows another approach to realize the noise-predictive part 
using a memoryless detector in cascade with a conventional 
PRML detector, according to the present invention. 
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FIG. 4 Shows a block diagram to illustrate the operation of the metric 

update unit (MUU). for some state s, at the time nT, according to 
the present invention. MMUs are major functional blocks in an 
NPML detector. 

FIG. 5 Shows a 2-state trellis diagram. 

FIG. 6 Shows an implementation of a 2-state NPML detector with a 4-tap 

predictor, according to the present invention. 

FIG. 7 Shows a 2-state trellis (difference metric) diagram. 

FIG. 8 Shows one possible way of mapping the algorithm implied by the 

trellis diagram in Figure 7 into hardware, where the threshold for 
15 ' the comparators is provided by the stored difference metric D n .. 

FIG. 9 Shows a 4-state trellis diagram. 

FIG. 10A-10C Shows another implementation of an NPML detector, according 
20 to the present invention (4-state. 2-tap predictor) 

FIG. 11A-11C Shows another implementation of an NPML detector, according 
to the present invention (4-state. 4-tap predictor) 

25 FIG. 12 Shows an 8-state trellis diagram with N=1 and K = 3 (8-state. 
1-tap predictor). 

FIG. 13 Shows a transformed 8-state trellis diagram with N = 1 and K = 3 
(8-state. 1-tap predictor). 

30 
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» FIG. 14 Shows one possible way of mapping the algorithm implied by the 
trellis diagram in Figure 13 into a hardware structure. The 

survivor path memory controlled by the select-signals SO S7, 

is not shown, 

5 

FIG. 15 Shows an alternate form of implementing the functions of the 
2-state NPML detector with a 4-tap predictor shown in Figure 6. 

FIG. 16A-16C Shows another implementation of an NPML detector, according 
»o to the present invention (4-state. N-tap predictor). 

FIG. 17A-17C Shows another implementation of an NPML detector, according 
to the present invention (4-state. N-tap predictor). 

15 

GENERAL DESCRIPTION 

In the following the principal forms of implementation of NPML Detectors are 
described. 

20 

The block diagram in Figure 1 shows how the present NPML detectors 10 fit 
into the existing PRML channel architecture. Customer data L are written in 
the form of binary digits a,e { -1. 4-1} by write head 15 on the disk 11 after 
being encoded in an encoder 12 by a rate-8/9 RLL code, serialized in a 

25 serializer 13 and precoded in a precoder 14 by a 1/(1 : hD ? ) operation where D 
is the unit delay operator. When retrieving the customer data from said disk 
11. an analog signal r(t) is generated by the read head 15 and provided at the 
read-head s output. This signal r(t) is then applied via the arm electronics 16 
to a variable-gain amplifier (VGA) circuit 17. The output signal of the VGA 

30 circuit 17 is first low-pass filtered using an analog low-pass filter 18 (LPF) and 
then converted to a digital form x ft by an analog-to-digital (A/D) converter 19. 
The functions of the A/D converter 19 and VGA unit 17 are controlled by the 
timing recovery and gain control loops 20 and 21, respectively. The analog 
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low-pass filter 18 is preferably a filter which boosts the higher frequencies to 
avoid saturation of the A/D converter 19. The digital samples x n at the output 
of the A/D converter 19 (line labeled A in Figure 1) are first shaped to PR4 
signal samples by the digital equalizer 22 (line labeled B in Figure 1) and are 
then passed on to the inventive NPML detector in the form of digital samples 
y„. After inverse preceding by means of a precoder 23 performing a (1<£D : ) 
operation, the output data of the NPML detector 10 (i.e. the final decisions: 
line labeled C in Figure 1) are fed via a deserializer 24 to a decoder 25 for the 
rate-8/9 RLL code which delivers the retrieved customer data !„.„. The 
inverse precoder function following the NPML detector in Figure 1 can be a 
separate functional block (as shown) or il can be imbedded in the trellir, 
(survivor path memory) of the detector. Figure 2A shows the blocks in 
Figure 1 which are relevant for the present invention: the digital equalizer 22. 
the NPML detector 10. and the inverse precoder 23. 

Generally, the coefficients of the digital equalizer 22 can be optimized so that 
the overall transfer function, including the head/disk-medium characteristics 
and the analog LPF 18. closely matches any desired system polynomial of the 
generalized partial-response form f(D) = d - f,D' - -r fpD 1 ") where the 
coefficients f can be arbitrary real numbers. For example, the 
partial-response (PR) polynomial for class-4 PR systems (PR4) is 
f(D) = (1 - D 7 ). Similarly. Ihe polynomial for extended partial-response 
class-4 (EPR4) systems is f(D) = (1 - D ? )(1 • D) =.- ( 1 -;■ D — D ? — D 3 ). A 
further example is f(D) = (1 - 0.1D - 0.9D ? ). 

Figure 2B shows the basic structure of the NPML detector 10 in the form of a 
prediction error filter 41 cascaded with a sequence detector (SD) with 
imbedded feedback (FB) 30. 

In the sequel we use PR4- equalized signals y, (Line B in Figure 2B). 
however, the inventive scheme can be applied to any shaping performed by 
the equalizer 22 in Figures 2A and 2B. 
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1 Figures 2C and 2D show two equivalent forms of NPML detectors, according 
to the present invention. Its basic principle can be explained as follows. Let y n 
be the output of the PR4 digital equalizer (line labeled B in Figures 1. 2C 
and 2D). This output then consists of a PR4 data signal and colored noise 

5 (colored interference components), i.e., 

~ a n - a n -2 + W n (1) 

10 where a„e { — 1. +1} denctes the encoded/precoded data sequence written on 
the magnetic medium with a rate 1/T and w n represents the colored noise 
sequence at the output of the digital equalizer 22. The power of the colored 
noise component (colored interference component) can be reduced by noise 
prediction. If p(D) = (p t D % + p 2 D 2 + ... + p N D N ) denotes the transfer 

15 polynomial, or equivalently E(D) = 1 -P(D) denotes the transfer polynomial of 
the prediction error filter, of the N-tap minimum mean-square (MMSE) 
predictor of the noise sample w n then the signal 



20 



25 



N 

- > w n _,p : - 

i*r5 (2) 

Z> 

i = 1 



= (y n -a n ha n _ 2 ) - ) (y n _ j -a n _ i H--2iP. 



represents the prediction error or equivalently the whitened noise component 
of the PR4-equalized output signal y„. Reliable operation of the 
prediction/whitening process is possible by using decisions from the path 
history associated with each state which is available in a sequence (Viterbi) 
30 detector. In that sense, NPML detectors are MLSD detectors for (PR) signals 
with imbedded prediction or equivalently imbedded feedback. 
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In view of (1) and (2) the branch metric of the NPML detector 10 for 
PR4-equalized samples corresponding to a transition from state s, to state s k 
takes the form 



Ms, s k ) = 



N 



Vn " 2J y n ~ i " 9 n - i< S j» H " 3 n - i - 2< S j^ Pi ~ a n + ^ - 2 
i — I 



(3) 



10 



15 



where the terms a 0 ,(s,). a, . ? (s.) represent past decisions taken from the 
path history associated with state s, and a.,, a., : are determined by the 
hypothesized state transition s,->Su. Clearly, the noise prediction process 
appears explicitly in the branch metric computation of the Viterbi algorithm 
implementing the NPML detector. Furthermore, it can be seen that by setting 
the predictor coefficients p, equal to zero, the branch metric in (3) becomes 
the branch metric of the 4-state PRML detector. 



The branch metric in (3) can also be written as 



20 



/(s j<Sk ) = 



N 

_ V 



i = 1 i=1 



(4) 



25 



By noticing that the first sum in (4) is state independent, and after some 
rearrangement of the remaining terms, the equivalent branch metric is 
obtained as 



30 



/MS:, S k ) 



z M + 



N + 2 

V . 

i = k + i 



K 

4 V, 



i = i 



(5) 



where the signal sample z., = y„ - ^y„ .p, is the output of the prediction error 
filter 41, shown in the equivalent NPML implementation of Figure 2C. and 
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{g 1f i = 1 t 2,... . N + 2} are the coefficients of the imbedded feedback filter 42 
(FIR: finite impulse response or RAM-based filter) in Figure 2C. It can be 

shown that the coefficients <g ( . i = 1.2 N 4- 2} introduced in (5) are the 

coefficients of the polynomial 

g(D) = (+ 1 -g,D'-g 2 D 2 g N . 2 D N 1 2 ) - (1 - D 2 )(1 - P(D)) - (1 - D 2 )E(D). 

The effective ISI memory L of the PR4-based NPML system is thus L = N + 2. 
The symbols a„. ,(s,) in the first summation term of (5) represent past decisions 
taken from the path history associated with state s,. whereas the symbols a„_, 
in the second summation term of (5) represent state information. Clearly, by 
increasing K we effectively increase the number of states of the NPML 
detector and decrease the length of the imbedded decision feedback. 
Conversely, by decreasing K the number of states is decreased at the 
expense of increasing the length of the imbedded decision feedback. Thus, the 
emerging family of NPML detectors, in accordance with the present invention, 
•offers a trade-off between state complexity and length of imbedded decision 
feedback. 

The two equivalent implemenlations of the NPML detector 10 shown in 
Figure 2C and 2D, respectively, require no change of the signal processing 
blocks, i.e.. VGA 17, analog LPF 18. digital equalizer 22. timing recovery and 
gain control loops 20 and 21. of the current PRML/EPRML channel 
architecture used by IBM and others. Any member of the family of NPML 
detectors according to the present invention can either replace the 
PRML/EPRML detector or operate concurrently with it. 

A third possible implementation of an NPML scheme in the form of a filter 
cascaded with a sequence detector with imbedded feedback is shown in 
Figure 2E. In this case the combination of digital equalizer 22* and prediction 
error filter 41 (see Figure 2B) is replaced by a single finite-impulse response 
filter designated as FIR1 51. Input to the filter FIR1 51 now are the 
unequalized samples x n at the output of the A/D converter 19 (line labeled A in 
Figure 1 and Figure 2E). The filter FIR1 51 has the property to whiten the 
noise and introduce a controlled amount of ISI in the signal samples z„ at its 
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output. The coefficients of the feedback filter (FIR2 or RAM 52) are then used 
in the branch metric computation of the sequence detector with imbedded 
feedback in the same way described above. Thus, the branch metric takes the 
form 



N + 2 K 



i ~ K + 1 ,^T | 



(6) 



where z„ is the output of FIR1 51 and >b.. i = 1.2.. . N 2\ is the set of 
coefficients of the filter FIR2 52. Note that expressions (5) and (6) are 
essentially the same. It can be shown that for infinitely long filters the three 
alternative implementations of sequence detection with imbedded feedback, 
shown in Figures 2A - 2E. are equivalent. 

It should be understood that the NPML principles described hereinabove can 
be applied to any form of system polynomial f(Dh In the sequel, however, 
only the PR class-IV polynomial (PR4) will be considered as the target 
polynomial. 

Performance and Preferred Parameters for NPML Detectors Used in DASD's: 

The error performance of a magnetic recording system employing NPML 
detection has been studied by computer simulations in order to determine the 
25 appropriate parameters N (number of predictor coefficients) and K (length of 
detector memory defining the number of detector states 2") to be used in a 
practical system. In particular, the cases described in this document for N = 1. 
N=2. and N=4 predictor coefficients lead to preferred NPML detectors. 



Two low-complexity derivatives of the NPML detector family have also been 
investigated. Both schemes, like the entire family of NPML detectors, require 
no change of the signal processing parts of the current PRML channel 
architecture (see also Figure 1). Figure 3A shows the noise-predictive part 
using a memoryless detector in cascade with a conventional PRML detector. 
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t The colored noise component of the PR4-equalized signal (line labeled B in 
Figure 1 and 3A) is first whitened by a predictor. Note that instead of 
imbedding the predictor into the MLSD process, a 3-level ( + 2. 0, -2) 
memoryless detector provides the (tentative) PR4 (signal sample) decisions 

5 needed for the whitening process. The PR4-equalized samples corrupted with 
the whitened noise components are then fed to a conventional PR ML detector 
and inverse precoder to obtain improved final decisions. Figure 3B is an 
equivalent form of the scheme in Figure 3A similar to the equivalent forms in 
Figure 2C and 2D. respectively. 

10 

The second low-complexity NPML detector scheme is based on the fact that 
PR4 sequences can be viewed as two independent, interleaved dicode 
sequences with polynomial (1 — D'). where D' refers to a delay of 2T. In this 
case each dicode sequence at the output of the digital equalizer (line labeled 

15 "B in Figures 1. 2C. and 2D) can be described by a 2-state trellis. The Viterbi 
algorithm, operating separately on each of these 2-state interleaved trellises, 
will use the branch metrics given in (3) or (5) where the time indices are either 
even or odd. For example, while the Viterbi algorithm operates on the even 
trellis, the time indices of the branch metric expression (3) or (5) will be even 

20 whereas the contribution of the odd past decisions in whitening the noise will 
come from the path memory with the best metric of the odd trellis. 



A further suboptimal scheme is to find the state with the best metric, compute 
the predictor output using the decisions from the survivor path corresponding 
25 to this best state, and applying it as a feedback term in the metric update 
computations for all states. This approach has the advantage that only a 
single RAM is needed. 

Concept of NPML Detectors with Nonlinear Predictors: 

30 

The NPML concept described herein is also applicable when the noise 
predictor has certain nonlinear characteristics and/or the computation of the 
predictor coefficients is based on a different noise model. 
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The present NPML architecture allows for great flexibility in optimizing the 
noise predictor function with respect to various kinds of random noise which 
occur in a practical target system. For example, only a portion of the total 
noise in hard disk drives is adequately modelled by additive white Gaussian 
noise (AWGN). Besides AWGN. the total noise includes other noise sources, 
such as signal-dependent disk noise, noise due to texture scratches, and so 
forth. In addition, to a certain extent coherent interference, such as clock 
and/or adjacent track signals, may also exist in the analog readback signal. 



»o Because the NPML concept allows, in effect, the transfer function for the 
signal portion of the input to be different than that for the noise and other 
interference components also present in the signal, the predictor may be 
optimized to minimize signal disturbances due to any type of corruptive 
source. Conventional detectors (such as PRML and EPRML detectors, etc.) 

is are only optimized to the extent that the signal disturbance at the detector 
input is additive, random, uncorrelated and Gaussian. This is often, a poor 
approximation in practical DASD systems: thus, using a linear predictor 
and/or computing the predictor coefficients based on this idealistic noise 
model may not lead to optimal solutions in situations where this assumption is 

20 poorly matched. 

In hard disk drives, both AWGN and so-called 'disk noise" are dominant 
sources of readback signal corruption. In the following, an example is given 
for a linear noise predictor with four coefficients (N=4) where the coefficients 

25 have been computed by incorporating AWGN as well as disk noise in the 
noise statistics. A simple model of disk noise is the so-called "transition jitter 
model" wherein the deviation of each written transition from its nominal 
location is a random variable. The effective SNR achieved af the input of a 
PRML detector is 15.4dB and at the input of a 64-state NPML detector 18.9dB 

30 for AWGN alone for a channel operating at PW50/T = 3. In case of AWGN 
combined with disk noise (transition jitter) the effective SNR at the input of a 
PRML detector is 12.7dB and at the input of a 64-state NPML detector 15.5dB 
for a channel operating at PW50/T = 3. It is interesting to note that the NPML 
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i detector is able to adapt the predictor coefficients to different noise statistics 
and thus to maintain an SNR margin of 2.8-3. 5dB over PRML. Although this 
example uses a 4-tap linear noise predictor for NPML, this technique and its 
benefits is herein claimed for ail possible types of noise predictors, including 

5 nonlinear predictors. 

Examples of Preferred Embodiments of NPML Detectors: 

The preferred form of implementation of an NPML detector within a PRML 
system is the one given in Figure 2C. More details on this embodiment of an 

10 

NPML detector 10 are given in this section. Figure 4 illustrates the operation 
of a major functional block in an NPML detector according to Figure 2C - the 
metric update unit (MUU) 68 shown here for state Si at time nT. Figure 4 
illustrates the required time relations between inputs and outputs of the 
various functional blocks. A separate MUU function must be provided for 

15 

each hypothesized state Sv. k = 1.2 2 K , where Ke{1.2 L} and L is the 

number of controlled !SI terms, e.g. for PR4 L = N + 2. In high-performance 
DASDs parallel MUU hardware must be provided for each state to meet the 
data throughput requirements; in principle, however, hardware could be 
shared if speed constraints permit. Furthermore, it is assumed here and 

20 

hereafter that the survivor path memory (SPM) 61. as shown for example in 
Figure 4, is implemented by using the register-exchange method, as for 
example described in the patent application GB-A-2286952. published on 30 
August 1995. 

25 

The branch metric (BM) units in a conventional MLSD (Viterbi) detector 
require only signal sample inputs, obtained directly from the equalizer (signal 
labeled B in Figure 2C). As indicated in Figure 4. it is a distinguished feature 
of the NPML detectors with K < L that each BM unit 62. 63 requires signal 
samples processed by a predictor 41 (signal labeled 7_ n in Figure 2C), as well 

30 

as an additional input from FIR or RAM-based filters 64. 65 (signals Gs, and 
Gs ( in Figure 4) in the feedback path between the SPM 61 and the MUU 68. 
Note that the feedback filters 64. 65 do not have a common serial input, but 
they are loaded in parallel at every symbol interval T. The input of each FIR 
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i or RAM-based filter 64. 65 is a set of most recent past decisions taken from 
the survivor path history stored in the SPM 61 for each hypothesized state 
(i.e.. s, and s.. respectively, in Figure 4). The add-compare-select (ACS) unit 
66 in Figure 3 adds the branch metrics to the state metrics Ms, and Ms,, 

5 respectively, compares the results, selects the survivor metric Ms* and 
provides the update signal Ss k for the corresponding decision path in the SPM 
61.. The SPM 61 produces final decisions at output line 67 with a delay of dT 
seconds relative to time nT. It is a further feature of the present NPML 
detector that the delay parameter d can generally be made shorter compared 

io to that of a conventional MLSD detector designed for PR signaling (i.e., PR 
signaling schemes with spectral nulls, such as PR4). 

NPML Detector Using Four Predictor Coefficients 
(N = 4) and Two States (K=1): 

For N = 4 and K= 1 the branch metrics based on (5) become 



20 



6 

i = 2 



25 



where the signal sample z„ = y„ - ^ y n ,p. is the output of the prediction 

• - i 

error filter 41. Associating the data symbols +1' and -1" with the binary 
numbers 1 and 0, respectively, the state information a„ . = — 1) is 

mapped into the present state s, = 1(0) and the present data symbol 
a n = -M( — 1) mapped into the next state s,, =■ 1(0). Letting 



30 



6 

i = 2 



(8) 
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G0 n _, = Va^itOJgi . (9) 



= 2 



5 one obtains the four branch metrics 

= |2 n + G1 n _, + g, - 1 I 2 . (10) 



;.{i.o) = i z n + Gi n _ , + g , + 1 : 2 . (id 

/U0.1) = | z n h- G0 n _ , - g, - 1 I 2 . (12) 
/(0.0) = | z n + G0 n _ , - g, + 1 | 2 . (13) 



where z„ are the samples obtained from the corresponding 4-tap prediction 
error filter connected in cascade with the equalizer (see Figure 2C). It will be 
20 useful to define the quantities 

Z11 n = z n + g, - 1 . (14) 



Z10 n = z n + g, + 1 . (15) 

Z01 n = z n -g, - 1 . • (16) 

Z00 n = z n -g, H- 1 . (17) 

since they can be precomputed outside the feedback loop, if necessary by 
means of pipelining. Thus. eqs. (10) - (13) can be written as 
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to 



15 



20 



25 



30 



/fl.D = |Z11 n + G1 n _, | 2 . (18) 
/(1.0) = |210 n + G1 n _, i 2 . (19) 
^(0.1) = !201 n + G0 n _,| 2 . (20) 



M0.0) = |Z00 n H-G0 n _, I 2 . (21) 

respectively. Finally, defining the stored metrics Ml,. , and M0„ , for states 1 
and 0. respectively, the trellis diagram shown in Figure 5 is obtained. The 
metrics are updated according to 



M1 n = min fM1 n _ , + /M.i): M0 n _ , ■, /(0.1 ),' . (22) 



M0 n = min {M1 n _ , -r- ;.(1.0): M0„ _ , -i- ;.(0.0)|- 



(23) 



and direct mapping of the trellis shown in Figure 5 into hardware functions 
leads to the implementation of the 2-state NPML detector with a 4-tap 
predictor 77 shown in Figure 6. It is proposed here to generate the terms 
G1„ . , and G0„ ,. defined by (8) and (9). respectively, by means of RAM-based 
filter structures 71. 72 which can be loaded with the appropriate (five) path 
history decisions. Also indicated in Figure 6 is the 2-state SPM 70 fed by the 
two comperators 58. In an alternate embodiment (not shown), the functions of 
SPM 70 and RAM-based filters 71. 72 shown in Figure 6 could be combined in 
an attempt to speed-up computation of G1„ , and GO., ... Note further, that the 
squaring functions in Figure 6. realized by means of units 73-76. can be 
approximated to simplify the required circuitry, with minimal loss in 
performance. The decision signals S1 and SO in Figure 6 are used to control 
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i the metric multiplexers 79 and the path update in the SPM 70. The selected 
metrics M1„ and MO n are stored in registers 80 and 81. respectively. 

A multitude of variations of the implementation shown in Figure 6 is possible, 
5 depending on constraints, complexity, critical timing paths, and algorithmic 
issues such as metric bounding. For example, automatic metric bounding can 
be achieved by using the conventional modulo technique in the adders 82-85 
feeding the comparator inputs 58, as described in "An Alternative to metric 
rescaling in Viterbi decoders.'' A. P. Hekstra. IEEE Transactions on 
io Communications. Vol. 37. No. 11, pp. 1220 -1222. November 1989. An 
alternate method of metric normalization can be implemented by applying the 
concept of a difference metric. Defining the difference metric 

15 _ Dn-1 = M1 n-1 "MOn- I ( 24 > 

the trellis in Figure 7 is obtained where the metrics are updated such that the 
metric for state 0 is always the zero-valued metric. Thus, the difference metric 
is updated according to 

20 

D n = min >D n „ . - ;.(1.1): /(0.1» - min{D n-l wM.0): /(0.0)} (25) 

where one can show that the cross-extension of the trellis in Figure 7, which 
would lead to the difference metric D n = /.(0.1 )-•[□„ ■ — ^(1,0)], is not 

25 

possible. Thus, only three of the four potential values ol D„ in (25) will have to 
be considered. One possible way of mapping the algorithm implied by the 
trellis description in Figure 7 into hardware is shown in Figure 8 where the 
threshold for the comparators is now provided by the difference metric D n - ■ 
stored in register 80. Figure 8 is otherwise similar to Figure 6. The 

30 

difference metric approach is useful in cases where it is not possible or 
convenient to use the conventional modulo technique which relies on 
2s-complement arithmetic for metric normalization. 

c 
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NPML Detector Using Two Predictor Coefficients 
(N = 2) and Four States (K = 2): 

For N = 2 and K = 2. i.e.. 2 K = 4 states, the branch metrics based on (5) become 



Ms r s k ) = 



A 2 

z n + Va^.fs^ + a n _ ,g, h- a n _ 2 g 2 - a n 



i = 3 



(26) 



10 



15 



where the signal sample z„ = y„ - y„ ,p, -y. ,p, is the output of the 2-tap 
prediction error filter Associating again the data symbols ' + 1" and with 
the binary numbers 1 and 0. respectively, we map the state information 
(a„. ? . a.,. ,) =r ( - 1. -1).( _ 1. .,. d,( + i _ Dand ( + 1 .+ 1) into the present 
state' s, = 0.1.2, and 3. respectively. Similarly, the next state information 
(a* a„) = ( - 1. -1),( - 1, -i- + 1. _ Dand ( -f- 1 . ~ 1) is mapped into the 
next state s> = 0. 1.2. and 3. respectively. Letting 



20 



G3 



_ V 



„_ , - 2 j a "- '' 3,g i ™ a n - 3 ,3 »93 
i - 3 



i<3)g^ 



(27) 



25 



30 



_9711544A1 _l_> 



-1 

G2 n-i = 2^ a ^-' (2,g i = a n-3'2)g 3 + a„_ 4 l2)g 4 . (28) 

i = 3 



G1„-i 



^an-.nJOi = a„ -3(1)93 *■ a n - 4 n)g 4 -- (29) 



i = 3 



GO n-i = X!a n _,(0)gi - a n _ 3 (0)g 3 -r a n _ 4 (0)g 4 . (30) 



i = 3 
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i one obtains the eight branch metrics 



10 



15 



;.(3.3) = Iz n + G3 n _, +g,+g 2 -1 |' . (31) 



M3.2) = | z n + G3 n _ , + g, + g 2 + 1 |*- . (32) 



;.(2,1) = | z n + G2 n _ , + g, - g 2 - 1 |* • (33) 



,1(2.0) = ! z n 4- G2 n _ , + g, - g 2 - 1 !' . (34) 



/(1.3) = |2 n + G1 n _, -g, +g 2 - 1 i 2 . (35) 



.4(1.2) = I z n + G1 n _ , - g, -»- g 2 + 1 | 2 . (36) 
20 /(O.H = | z n G0 n _ , - g, - g 2 - 1 ! 2 . (37) 

/(0.0) = | z n f G0 n _ , - g, - g 2 - 1 ! 2 (38) 

25 where z„ are the samples obtained from the corresponding 2-tap predictor 
filter connected in cascade with the equalizer, see Figure 2C. It will be useful 
to define the quantities 



Z33 n = z n + g, 4- g 2 - 1 . (39) 

30 



Z32 n = z n + g, + g 2 + 1 (40) 
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Z21 n = z n + g, - g 2 - 1 . 



(41) 



Z20 n = z n + g, - g 2 * 1 



(42) 



213„ = z„- 



n - 9l + 92 ~ 1 



(43) 



to 



Z12„ = 



Z n - 9l + 92 ~ 1 



(44) 



Z01 n = Zn - fll - 9 2 - 1 



(45) 



ts 



Z00 n = z n - g, - g 2 + 1 



(46) 



since they can be precomputed outside the feedback loop, if necessary by 
means of pipelining. Thus. eqs. (31) - (38) can be written as 



20 



;:(3.3) = I Z33„ + G3„ 



(47) 



/(3.2) = | Z32 n + G3 n ._ 



(48) 



25 



/K2.1) = | Z21„ - G2 n _ , 



(49) 



A (2,0) = 



Z20 n + G2 n _ , ! 2 



30 



(50) 



/(1.3) = i Z13 n + G1 n _ , 



(51) 
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>-f 1.2) = |Z12 n + G1 n _, ! 2 . (52) 

x(0.1) = |Z01 n + G0 n _, i 2 . (53) 

5 

X(0.0) = |Z00 n + G0 n _ t | 2 . (54) 

respectively. Finally, defining the stored (present) metrics Ms ln _, for each of 
10 the present states s. = 0.1.2. and 3, one obtains the trellis diagram shown in 
Figure 9. The four metrics for the next states s. =■ 0,1.2. and 3. are updated 
according to 

Ms k n = min f Ms j , + Ms- r s k ); MSj ( 4- /As r s k )} , (55) 

with s, and s, being the possible present states. Direct mapping of the trellis 
shown in Figure 9 into hardware functions leads to the scheme shown in 
Figures 10A. 10B. 10C. The terms G0 n ,. G1„ ,. G2„ and G3., defined by 

20 (27) - (30). respectively, can be generated by means of a random access 
memory 131-134 (RAM) which stores the appropriate values for the given 
coefficient g, and g 2 . depending on the chosen operating point of the channel; 
the RAMs 131-134 need to hold only four different (actually two different and 
their negative) values. The 4-state SPM 135 is a register exchange structure in 

25 case of high-speed implementation. Note that the squaring functions in (31) - 
(38) can be approximated to simplify the required circuitry, with minimal loss 
in performance. Four decision signals (SO. SI. S2. S3) are needed to control 
the metric multiplexers and the path update in the SPM 135. Automatic metric 
bounding is achieved by using the conventional modulo-2 technique in the 

30 adders 136-143 feeding the comparator inputs. 
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i NPML Detector Using Four Predictor Coefficients (N=4) 
and Four States (K=2): 

For N=4 and K = 2. i.e., 2" = 4 states, the branch metrics based on (5) become 



to 



^ 2 

X(Sj.S k ) - 



2 n + J] a n - i( s j)9i + a n - i9i + a n 2 g 2 - a n 

i = 3 



(56) 



ion 



where the signal sample z.. = y., - v y „ , p . is (he output of the predjct 
error niter. Associating again the data symbols +1" and with the 

binary numbers 1 and 0. respectively, the state information 
(a„. ? .a„ ,} = ( - 1. - 1). ( _ l .j. 1) ( + i._ n. and , K ^ 1} js mapped jnlo 
the present state s, = 0. 1.2. and 3. respectively. Similarly, the next state 
15 Jnformation |a„. ,.a„) = ( - I. - - i. + i M . L _ „ and , M f1) , is 
mapped into the next state s. = 0. 1,2. and 3. respectively. Letting 



6 

20 ,T~3 



G3 n- . = Va n _,(3) gi . (57, 



G2 



n - 1 



25 



6 

y 



an-if2)9i 



(58) 



G1 



ii - i 



6 

V 

i -= 3 



(59) 



30 



GO 



n - 1 



6 

V 

i = 3 



a„ _ i(0) gj 



(60) 
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1 one obtains the eight branch metrics 

;.(3.3) = i z n + G3 n _ , + g, H- g 2 - 1 I 2 . (61) 

5 

2 



x(3.2) = | z n + G3 n _ , + g 1 + g 2 + 1 I . (62) 



;.(2.1) = |z n + G2 n _, +g, -g 2 - 1 I 2 . (63) 



10 



15 



;.(2.0) = i z n + G2 n _ , + g, - g 2 + 1 I 2 . (64) 



A(1,3) = |z n + G1 n _, -g, +g 2 - 1 I' • (65) 

,1(1,2) = t z n + G1 n _ , - g, + g 2 -i- 1 I 2 . (66) 

/(0.1) = I z n + G0 n _ , - g, - g 2 - 1 I 2 . (67) 

X(Q.O) = i z n + G0 n _ , - g, - g 2 + 1 ! 2 . (68) 

25 where z„ are the samples obtained from the corresponding 4-tap prediction 

error filter connected in cascade with the equalizer (see Figure 2C). It is 
useful to define the quantities 



20 



30 



Z33 n = z n + g, + g 2 - 1 . (69) 



Z32 n = z n + g, + 9 2 + 1 • (70) 
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221 » = z n + Si - g 2 ~ 1 • (71) 

Z20 n = z„ + g, - g a + 1 . (72) 

Z13 n = z n - 9t + 9 2 ~ 1 • (73) 

Z12 n = *n ~ 9l + 9 2 -*- 1 • (74) 

Z01 n = Z n ~ 9l ~ 9 2 - 1 • (75) 

15 - Z00 n = z n - g, - g 2 4- 1 . (76) 

since (hey can be precompiled outside the feedback loop, if necessary by 
means of pipelining. Thus. eqs. (61) - (68) can be written as 



10 



20 



25 



30 



V.(3.3| = !Z33 n H- G3 n _ . ' 2 . (77) 

;.(3.2) = ! Z32 n - G3 n _ , i 2 . (78) 

/(2.1) = !Z21 n ^G2 n _, i 2 . (79) 

/(2.0) = i Z20 n - G2 n _ , \ 2 . (80) 

/(1.3) = !Z13 n -rG1 n ,,| 2 . (81) 
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/(1.2) = i Z12 n ^ G1 n _ , | 2 . (82) 

/(0.1) = iZ01 n + G0 n _ , I 2 . (83) 

5 

x(0,0) - I Z00 n + G0 n _ , | 2 . (84) 

respectively. Finally, defining the stored (present) metrics Ms tR . , for each of 
io the present states s t = 0. 1. 2, and 3. one obtains the trellis diagram shown in 
Figure 9. The four metrics for the next states St = 0. 1. 2. and 3. are updated 
according to 

Ms kn = min {Msj f + X(s y s k ); Ms } t -I- /(s if s k )} . (85) 

with s, and s, being the possible present states. Direct mapping of the trellis 
shown in Figure 9 into hardware functions leads to an implementation 
structure shown in Figures 11 A. 11B. and 11C. Note the similarity, 

20 respectively the differences, compared lo Figures 10A. 10B. 10C (size of 
predictor filter and RAM address length). The terms G0 o G1 n G2 n . i. and 
G3„ . ,. defined by (57) - (60), respectively, can be generated by means of RAM 
structures which can be loaded with appropriate values depending on the 
chosen operating point of the channel: the 4-state SPM can again be a 

25 register-exchange structure. Note that the squaring functions in (61) - (68) can 
be approximated to simplify the required circuitry, with minimal loss in 
performance. Four decision signals SO, S1, S2, and S3, are needed to control 
the metric multiplexers and the path update in the SPM. 

30 A multitude of variations for the implementation of the 4-state NPML detector 
with a 4-tap noise predictor is possible, depending on constraints on 
complexity, critical timing paths, and algorithmic issues such as metric 
bounding. For example, automatic metric bounding can be achieved by using 
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1 the conventional modulo technique in the adders feeding the comparator 
inputs. The alternate method of metric normalization, the difference metric 
technique introduced above, can be extended to the 4-state NPML detector, 
for example, by updating the metrics such that the stored metric of state 0 is 

5 always the zero-valued metric. Further variations for implementing (4-state) 
NPML detectors can be obtained by explicit expansion of the squaring 
function involved in evaluating the branch metrics A(s ( . s,). 

NPML Detector Using a Single-Tap Predictor (N = 1) 
io and Eight States (K = N + 2 = 3): 

It was pointed out hereinbefore that the 8-state NPML detector which uses a 
single-tap predictor (i.e.. the case where N = 1 and K=N+2=3 so that 2 K = 8 
states) is a member within the family of NPML detectors which is of specific 
is _P ractical interest for DASD applications. Since in this particular case there is 
no feedback based on past decisions, i.e.. the detector uses only 
(hypothesized) state information for noise prediction, the feedback loops via 
the FIR or RAM-based filters 64 and 65. as shown in Figure 4. are not present. 
For N = 1 and K = 3 the 16 branch metrics based on (5) become 

20 

/(S|, s k ) = | z n + a n _ lfll 4- a n _ 2 g 2 r a n 3 g 3 - a n | 2 (86) 

where the signal sample z„ = y P - p,y n Furthermore, since in (86) 

9' = Pi, g2 = 1. and g 3 = -p,. we can write 

25 

/(Sj. s k ) = .| z n -I a n _ lPl -h a n _ , - a n , Pl - a n \ 2 (87) 

where the triple (a„ ?. a^ 2 , a n , ) represents the hypothesized state s ( . a n is the 
30 hypothesized transmitted symbol, and the triple (a, \. a„ ; . a,) represents the 
resulting next state s„. In this situation it is advantageous to evaluate the 
square on the right hand side of (87). to drop all state-independent terms, and 
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i to scale the remaining expression. In this way. one arrives at the equivalent 
branch metric 

/'{Sj, s k ) = -[a n -a n _ 2 -(a n _ 1 -a n , 3 ) Pl ]z n 

- (a n a n _ , + a n _ 2 a n _ 3 ) Pl - a n a n _ 2 (88) 
+ (a n a n . 3 + a n .,a n . 2 )p, - a n ,a n _ 3 p? . 

We now arbitrarily use a somewhat different rule than the one used above to 
10 map the state information into the corresponding state number, namely, 

s, = (a n . j. a„ ? , a., . i ) = ( — 1. — 1. — 1) maps to the state 0, 

s, = (a„o. a« ?. a n i) = ( + 1 . — 1 . — 1 ) maps to the state 1. 

s, = (a n .. i, a n 2 . a„ ,) — (" + 1 . + 1 . -f 1 ) maps to state 7. Next, adding the. 

state-independent term (1 + p?) to all sixteen branch metrics represented by 
15 (88) and dividing the result by 2. the equivalent branch metrics can be listed 

as 

/"(0,0) = ;."(2,5) = /t"(5,2) = ;/'(7.7) = 0 (89) 

20 

;."(0.4) = /"(5.6) = - z n + 1 (90) 
/>"(1,0) = ;/'(3.5) = Pl (-z n + Pl ) (91) 

25 

A"(1.4) - oc( z n + a) (92) 
;/'(2.1) = A"(7,3) = z n -I- 1 (93) 

30 

/"(3,1) = //(z n + /0 (94) 
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/"K2) = /"(6.7) = p,Jz n ^ Pl l (95) 



/"(4.6) = //( - z n -!- //) (96) 

5 

/:"(6,3) = a(z n 4- .7) (97) 

where 

10 

7 = 1 + Pl . // = 2 - v = 1 - Pl . z n - y n - Pl y n _ , . (98) 

Defining the stored metrics Ms Jr , for the states s, = 0. 1 7. one arrives at 

l5 - the trel,i s diagram shown in Figure 12. The eight metrics Ms,„ for the next 
states s. = 0. 1 7. are updated according to 

Ms k| = min{M Sj ^/"(Sj.s^: M Sj ( 4- /"(s,. s k )} , (99) 

20 

with s. and s, being the states at time n-1. according to the trellis in Figure 12. 
The latter can in principle be mapped directly into a hardware structure. 



The trellis in Figure 12 can be further simplified by applying a similar 
transformation technique as described in the patent application GB-A-2286952. 
published on 30 August 1995: the resulting transformed trellis is shown in 
Figure 13 where 12 of the 16 branch metrics are zero-valued and the 
remaining four have values 2p. or -2p,. Defining the filtered samples 



30 Y n = -PiVn+ i + < 1 + Pi)y n - PiY„_ i (100) 

where y„ = a„ - a n . 2 -\r noise is the PR4-equalized. noisy sample, the 
quantities Z n and On shown in the trellis of Figure 13 can be expressed as 
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1 Z n = Y n + (1 -Pt) (101) 

and 

5 

On = "Y n + (1 + Pi) - -Z n ~ 2(1 + P1 2 ) . (102) 

respectively: if necessary, these quantities can be computed by pipelined 
circuitry since they are not part of the metric feedback loop. Direct mapping 
10 of the trellis in Figure 13 into a hardware structure ieads to the scheme 
shown in Figure 14: the eight decision signals SO - S"7 also control the 
operation of an 8-state SPM (register-exchange) not specifically shown. The 
SPM delivers the final decisions via the inverse precoder 

15 A significant feature of the NPML scheme described by Figures 12 - 14 is its 
ability to perform the detection function for arbitrary values of the noise 
predictor coefficient p,. Thus, by programming the hardware with the best 
suited predictor coefficient (depending on the channel operating point), 
optimal detection rs obtained within the constraints of a single-tap predictor. 

20 In particular, by setting p, = 0. the scheme performs detection for PR4 
signals, i.e., the hardware operates as a PRML detector. On the other hand, 
setting p, = -1. the scheme performs detection for EPR4 signals, i.e.. the 
hardware operates as an EPRML detector. The maximum required length of 
the SPM or. equivalently, the maximum decision delay for the final decisions, 

25 should be chosen such that the performance can be maintained for the most 
sensitive scheme (e.g. EPRML). 

For implementation purposes, it may be advantageous to modify the algorithm 
outlined for the flexible 8-state. single-tap predictor NPML scheme by adding 
30 a convenient, state-independent term to Z„ defined in eq. (101). e.g., such that 
Z,> — Z' n = Y„. It has been shown in the patent application GB-A-2286952. 
published on 30 August 1995. that the performance of the EPRML is not 
affected by such a measure since the channel is DC-free (spectral null at zero 
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frequency); this property extends to NPML detectors as well. Thus, an 
alternate version of the scheme in Figure 14 is obtained by modifying Z n and 
0" such that Z M -Z'. = Y n and 

°« -* = " Y » + 2 < 1 + Pi 5 ) = - Z'„ + 2(1 + p, T ). respectively. Note that the 
condition -Z„ + O n = Z'„ + Q\ = 2(1 + p, 2 ). must always be satisfied by 
theory. However, as described in the patent application GB-A-2286952, 
published on 30 August 1995. it may be advantageous in practice to modify 
this condition such that Z„ + Q n = Z' ft tQ' n = 2(1 + p,-| - where y is a 
small, positive constant: a practical value may be y 0.25. 

Alternate Forms of Implementation and Modifications: 

This section further demonstrates the multitude of forms of implementation 
which are possible for NPML detectors according to the present invention. 
Some alternate forms and simplifications of the detectors presented above are 
now described in some detail: 

2-State, 4-Tap Predictor NPML: 

Letting 



6 

G1' n _ , = g t 4- Ya, _ t il)g t . (103) 

i = 2 



G 0' n _ , = -g, - yV.jtOlg, 



(104) 



one obtains the four equivalent branch metrics 



=r. |z n + Gr n _,-i! 2 . (10 5) 
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/l(1.0) - |z n + G1' n _ , + 1 i 2 . (106) 



,1(0.1) - |z n + G0' n _i ~ 1 S 2 • ( 1 ° 7 > 



A(0,0) = |z n + G0' n _, + 1 | 2 , (108) 



where z n are the samples obtained from the corresponding 4-tap prediction 
1 <> error filter connected in cascade with the equalizer (see Figure 2C). It is 
useful to define the new quantities 

Z1 n = z n - 1 . (109) 



15 



20 



25 



30 



Z0 n -= z n + 1 . (110) 
so that eqs. (105) - (108) can be written as 

/(1.1) = i Z1 n - G1' n _ 1 f 2 . (1111 



A(1.0) - |Z0 n + G1' n _, : 2 . (112) 



;.(0.1) - |Z1 n + G0' n _ , . (113) 



X(0.0) = IZ0 n 4- G0' n „ ,1 | 2 . (114) 



The alternate form of implementing the functions of Figure 6 is shown in 
Figure 15. Here, it is proposed to generate the terms G1V, and GOV,, 
defined by (103) and (104), respectively, by means of random access memory 
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1 table look-up where the RAMs 121. 122 can be loaded with appropriate values 
(32 for each RAM) depending on the chosen operating point of the channel. 

The SPM 123 provides the five address bits a. : ( 1 ) a n . fi (1) and 

an-?(0) a„ *(0) for the RAMs 121 and 122. respectively, as indicated in 

5 Figure 15. 

Computation of the branch metrics for the difference metric approach 
(Figure 8) can be modified similarly; in this case, further simplifications are 
possible. For example, the potential difference metrics 

10 D„ = /(0.1) - M0.0) = -4(z„ i-GO', ,) and 

D„ = /.(1,1)- <(1.0) = -4(z., G1'„ ) which have to be precomputed. have 
simple expressions in terms of the signal sample z.. and the respective 
quantities generated by the RAMs. 

15 4-State. N-Tap Predictor NPML where N = 2 or 4 (Alternative 1): 
Letting 



20 



G3'.. 



N -t- 2 
i - 3 



,(3)ch 



(115) 



25 



N + 2 



G2' 



n - 1 



(116) 



i = 3 



N + 2 



G1 'n-I - - 91 + 9 2 + y a n - ,M)g, . ' (117) 



• = 3 

30 



N + 2 



G0 'n-1= -gi-9 2 + ya n-i (0|g, . ~ (118) 



i = 3 
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i one obtains the eight equivalent branch metrics 



;.(3.3) = i z n + G3' fl _ , -1| 2 . (119) 



/(3.2) = ! z n + G3' n _ , -f 1 I 2 . (120) 



10 



15 



x(2.1) - |z n + G2' n _, -1 . (121) 

/U2.0) = ! z„ + G2* n _ , +1 r . (122) 

;:(1.3) = |z n ^Gl' n _, -1 ! 2 . (123) 

^(1.2) = |z n + G1' p _, M ! 2 . (124) 

20 /(0.1) - ! z n -.- G0' o _ , -1 \ 2 . (125) 



/(0.0) =-- | z n + G0' n _ , +1 | 2 . (126) 

25 where z„ are the samples obtained from the corresponding N-tap prediction 
error filter connected in cascade with the equalizer (see Figure 2C). By 
making use of the definitions Z1„ = z„ - 1 and Z0.. -- z., + 1. respectively, eqs. 
(119) - (126) can be written as 

30 o 

,1(3,3) = |Z1 n +G3' n _ , | 2 . (127) 



;.(3.2) = |Z0 n + G3' n _ , I 2 . (128) 
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= IZ1 n + G2' n _ t !- 



(129) 



/(2.0) = | Z0 n 4- G2' 



n-H (130) 



/T1.3) = IZ1 n +G1V, :2 - (131) 



/(1,2) = iZ0 n + Gr n „, ! 2 - (132) 



/(0.1) ^ IZ^ + GO',,., r . (133) 



/(0.0) = !Z0 n -h G0' n _ , . (13 4) 

respectively, yielding again the trellis diagram shown in Figure 9. Direct 
mapping of this trellis into hardware functions - by using the new variables as 
defined above - leads to the structure shown in Figures 16A. 16B. 16C. The 
terms GO',. G!'„ G2\. .. and G3' defined by (115) - M18). respectively, 
can again be generated by means of RAMs 151-154 which can be loaded with 
the appropriate values depending on the chosen operating point of the 
channel: the 4-state SPM 155 - assuming again a register-exchange structure - 
provides the N address bits for each of the four RAMs (one per state); 
equivalent^, these four RAMs 151-154 can be combined into a single RAM 
structure with multiple inputs and outputs. Automatic metric bounding is 
achieved by using the conventional modulo-2 technique in the adders feeding 
the comparator inputs. 

NPML Detectors Implemented by Analog VLSI Technology: 

Implementation of any detector included within the family of NPML detectors 
can be done in either digital, analog, or mixed digital/analog VLSI circuit 
technology. Implementations in analog technology are of particular interest in 
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i high data rate and/or low-power applications. An example for PRML is 
described in 'Analog Implementation of Class-IV Partial-Response Viterbi 
Detector". A.H. Shakiba et al.. Proc. ISCAS'94. 1994: similar methods can be 
applied to NPML detectors. 

5 

NPML Detectors with Reduced SPM Length: 

It was indicated above, that NPML detectors generally do not exhibit 
quasi-catastrophic error propagation. This property can be exploited to save 
hardware and reduce decoding delay by reducing the length of the path 

10 

memory in the Viterbi detector without compromising performance. On the 
other, these hardware savings may be traded for additional increases in 
recording density by using run-length limited (RLL) codes with a higher rate 
than 8/9. since the code constraints relating to the length of the SPM can be 
relaxed. 

15 

4-State, N-Tap Predictor NPML where N = 2 or 4 (Alternative 2): 

Letting in (119) - (126) 

20 

G33' n _ , = G3' n _ t -1 (135) 
G32' n _ , = G3' n _ , -i-1 (136) 

25 

and so on, we can write the eight branch metrics as 

;.(3.3) = |z n + G33' n-1 | 2 . ' (137) 

30 

X{22) = iz n + G32' n _ , | 2 . (138) 

/(2.1) = | Zn + G21' n _, i 2 . (139) 
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;.(2,0) = |z n + G20' n _, j 2 . (140 , 



/(1.3) = Iz^Gir^, I 2 . f 14 1) 



x(1,2) = |z n -hG12' n _ t | 2 . {14 2) 



io ; < 01 > = !*n -GOr,^, r . (143) 



;.(0.0) - I z n G00' n _ , 1 ! 2 (144) 

t5 This version leads to the implementation shown in Figures 17A. 17B. 17C 
where the squaring function can be approximated as shown by A. Eshraghi et 
al., in "Design of a New Squaring Function for the Viterbi Algorithm". IEEE 
Journal of Solid State Circuits. Vol. 29. No. 9. September 1994. pp. 
1102 - 1107. 

20 



25 



30 
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i CLAIMS 

1. Apparatus for noise predictive maximum likelihood (NPML) sequence 
detection in a channel, comprising: 

5 a) a prediction error filter for whitening colored random noise 

. components of a sample y n received via said channel, said sample y n 
comprising a generalized partial-response signal component 
corrupted by said colored random noise components, leading to a 
signal z„ now having L intersymbol-interference components. 

io b) a sequence detector having 

• a state complexity being equal to 2" . with 0 < K < L and L 
reflecting the number of said intersymbol interference 
components, and 

• survivor path means for storing path history decisions 
15 corresponding to 2 K survivor paths. 

c) means for cancellation of L-K of said L intersymbol-interference 
components, said means comprising 

• feedback means for intersymbol-interference cancellation using 
precomputed and stored intersymbol-interference cancellation 

20 terms, and 

• means for retrieving said intersymbol-interference cancellation 
terms by applying said path history decisions as addresses to 
said feedback means for intersymbol-interference cancellation. 

25 

2. - The apparatus of claim 1, wherein said means for cancellation comprises 

at least one random access memory for storing said intersymbol 
interference cancellation terms, said random access memory being 
arranged such that intersymbol-interference cancellation terms are 
30 retrieved by applying a path history decision taken from said survivor 

path means as address to said random access memory. 
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1 3. The apparatus of claim 1. wherein said sample y n received via said 
channel is a partial-response signal and in particular a partial-response 
class-4 (PR4) shaped signal. 

5 4. The apparatus of claim 1, wherein said sequence detector is a Viterbi 
detector. 

5. The apparatus of claim 1, wherein said sequence detector is a 2-state 
sequence detector and said prediction error filter comprising a 4-tap 

io predictor. 

6. The apparatus of claim 1. wherein said sequence detector is a 4-state 
sequence detector and said prediction error filter comprising a 2-tap 
predictor. 

15 

7. The apparatus of claim 1. wherein said sequence detector is a 4-state 
sequence detector and said prediction error filter comprising a 4-tap 
predictor. 

20 8. The apparatus of claim 1. wherein said sequence detector is a 8-state 
sequence detector, preferably a programmable one. and said prediction 

error filter comprising a 1-tap predictor. 

9. The apparatus of claim 1. either comprising a separate inverse precoder 
25 fed by the output of said detector, or means for imbedding the inverse 

precoder function into said sequence detector. 

10. The apparatus of claim 1. having a transfer function for the signal portion 
of said sample y., being different from the transfer function for said 

30 colored random noise components. 
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1 11. The apparatus of claim 2. wherein said prediction error filter and/or said 
random access memory has a non-linear transfer characteristic. 

12. The apparatus of claim 2. wherein said prediction error filter and/or said 
5 random access memory is programmable. 

13. The apparatus of claim 12. comprising means for adaptive setting of said 
programmable prediction error filter and/or said random access memory 
such that its characteristic automatically adjusts as the colored random 

10 noise on said data channel changes. 

14. The apparatus of claim 1 or 2 being either completely or partially 
implemented in analog circuit technology- 
is -15. The apparatus of claim 1 or 2. wherein said feedback means comprise a 

feedback finite-impulse response (FIR) filter. 

16. The apparatus of claim 1. comprising: 

• a memoryless detector for determining a nominal expected value, 

20 • means for estimating the noise contribution in a plurality of past 

digital samples by subtracting the value of a sample from said 
nominal expected value. 

• means for predicting the noise contribution of the current received 
sample using the noise contribution in a plurality of said past digital 

25 samples, 

• means for adding or subtracting the predicted noise contribution 
to/from the current received sample, and 

• means for feeding the output of the means for adding or subtracting 
to a conventional partial response maximum likelihood (PRML) or 

30 extended partial-response maximum likelihood (EPRML) detector. 
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■17. The apparatus of any of the claims 1-16, wherein said channel is a data 
transmission channel and said apparatus is employed for estimation of 
data received via said data transmission channel. 

18. Direct access storage device, in particular a disk drive, comprising direct 
access storage means and an apparatus for noise predictive maximum 
likelihood (NPML) sequence detection according to any of the claims 1 - 
16. said channel being a storage channel for feeding signals retrieved 
from said direct access storage means to said apparatus. 

19. The apparatus of any of the claims 1-16, being integrated into 
(piggy-backed) on a partial-response maximum likelihood (PRML) or 
extended partial-response maximum likelihood (EPRML) system. 



-20. The apparatus of claim 19, wherein a digital equalizer, being part of said 
partial-response maximum likelihood (PRML) or extended 
partial-response maximum likelihood (EPRML) system, and said prediction 
error filter are replaced by a single finite-impulse response filter having 
the property to whiten said colored random noise components of said 
20 sample y n . 

21. The apparatus of any of the claims 1-16. being connected to a partial 
response maximum likelihood (PRML) or an extended partial-response 
maximum likelihood (EPRML) detector such that one can switch from a 
first state where the apparatus and either one of said detectors operate 
concurrently to a second state where either said partial response 
maximum likelihood (PRML) or extended partial-response maximum 
likelihood (EPRML) detector, or said apparatus processes said sample y n 
received via said channel. 



22. Method for noise predictive maximum likelihood (NPML) sequence 
detection by means of a sequence detector having a state complexity 
being equal to 2 K . with 0 < K ^ L. said method comprising the steps: 
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i a) whitening colored random noise components of a sample y n received 

via a channel, said sample y tl comprising a generalized partial 
response signal component corrupted by said colored random noise 
components. leading to a signal z.» then having L 

5 intersymbol-interference components. 

b) eliminating K of said L intersymbol-interference components by 
carrying out a branch metric computation based on. a 2*-State Viterbi 
algorithm to determine the most likely sequence corresponding to 
said sample y n , and 

io c) if there are any intersymbol-interference components left. i.e.. if 

L - K > 0. 

• precomputing intersymbol-interference cancellation terms. 

• storing said intersymbol-interference cancellation terms in 
storage means, 

15 * • retrieving said intersymbol-interference cancellation terms from 

said storage means by applying path history decisions from said 
sequence detector as addresses to said memory means, 

• cancelling said L-K intersymbol-interference components in said 
signal z„ using said intersymbol interference cancellation terms. 

20 

23. The method of claim 22, comprising the steps: 

• estimating the noise contribution in a plurality of past digital samples 
by subtracting the value of a sample from a nominal expected value, 
said nominal expected value being determined by simple 

25 memoryless detection, 

• using the noise contribution in a plurality of said past digital samples 
to predict the noise contribution of the current received sample, 

• adding or subtracting the predicted noise contribution to/from the 
current received sample, and 

30 • feeding the output of the last step to a conventional partial response 

maximum likelihood (PRML) or extended partial-response maximum 
likelihood (EPRML) detector. 
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i 24. The method of claim 22. wherein said sample y, received via said channel 
is a partial-response signal and in particular a partial-response class-4 
(PR4) shaped signal. 
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